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We present a method for enhancing the cross-correlation of gravitational wave 
signals eventually present in data streams containing otherwise uncorrelated noise. 
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£NJ ' Such method makes use of the wavelet decomposition to cast the cross-correlation 



time series in time-frequency space. Then an entropy criterion is applied to identify 
the best time frequency resolution, i.e. the resolution allowing to concentrate the 
signal in the smallest number of wavelet coefficients. By keeping only the coefficients 



CN ■ above a certain threshold, it is possible to reconstruct a cross-correlation time series 
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where the effect of common signal is stronger. We tested our method against signals 
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^ | injected over two data streams of uncorrelated white noise. 
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*U ■ I. INTRODUCTION 

Gravitational waves (GW) are expected to be emitted by a variety of cosmological and 
astrophysical sources and several detectors are now operating to directly observe such a 
signal. 

Some sources has been thoroughly studied and the expected GW-signal they emit is well 
modeled, like binary coalescences (without spin) and spinning neutron stars. For this class 
of sources the standard, matched-filtering techniques can be adopted, which are able to dig 
well under the noise floor. However some astrophysical systems may emit signals which 



* Electronic address: 



roberto.terenzi@cern.ch, riccardo.sturani@uniurb.it 



2 



are poorly modeled at the moment, not to mention unknown sources. This is the case for 
spinning coalescing systems, stellar core collapse events, cosmological stochastic signal, to 
mention just a few, known cases. 

In ordered to be able to spot these unmodeled signals it is necessary to use a wide net, 
capable of detecting any suspect excess noise in the data streams. Following this strategy 
several burst searches are performed by the LIGO/VIRGO [1] as well as by the resonant 
antennas network , in which the technique of cross-correlation of data streams between 
two (or more) detectors is employed, possibly jointly with other techniques, to identify 
GW-event candidate, see e.g.0, si, B]. The cross-correlation analysis consists in obtaining 
a suitable defined data stream out of two detector data streams, and check for anomalous 
excesses in it. Cross-correlation can also be used in combination with matched filtering 
technique. 

Here we present a method to enhance to capability of cross-correlating techniques to 
detect GW-signals. Our method is based on a wavelet decomposition of the cross-correlation 
data stream. Wavelet decomposition have been already adopted in GW-data analysis, see 
e.g. jsl, E3]. In particular the WaveBurst algorithm 10, U] has been employed in the the 
LIGO/VIRGO burst search. 

Wavelet decomposition is a time-frequency decomposition. Unlike in the familiar case 
of Fourier Transform, in wavelet decomposition the time and frequency resolution is a free 
parameter (though the time and frequency resolutions are not independent on each other). 
The method described in the following section is able to adaptively select the decomposition 
level, or time-frequency resolution, which is best for the generic data stream analyzed. In 
particular this method, presented in a different context in [ijj], picks the resolution which 
gathers the power of the data stream in the smallest possible number of wavelet coefficients, 



enhancing the possibility of singling out excesses o 



is obtained by considering an entropy function 13], which is maximum when the power of 



cross-correlation. The resolution choice 



the signal is equally distributed among all the coefficients and minimum when all the power 
is concentrated in only one coefficient. Once the best resolution has been individuated, 
by a process of thresholding the detection of correlation excesses can be enhanced with 
respect to the analysis obtained by simply cross-correlating the data streams without further 
processing. 

The paper is organized as follows: in sec. [Ill we describe the method, in sec. II III we 
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report the results obtained by analyzing a stream of white data sampled at 5kHz containing 
injections of short- duration signals. Sec. IIVI contains the conclusion of our analysis. 

II. THE METHOD: WAVELET PACKET AND ENTROPY FILTER 

A wavelet transform is used to transform a time series into a mixed time-frequency one. 
Given an initial series made of N data points, with sampling Ato, it contains information 
on frequencies ranging from zero up to the Nyquist frequency f m ax = (2A£o) _1 - Different 
decomposition levels are available in a wavelet transform, labeled by a (limited) integer 
number j. At each level the wavelet-transformed data stream contains N coefficients as 
the initial time series, arranged in 2 J layers, each with N x 2 _ - J coefficients. At level j the 
time resolution is Atj = 2- ? At and the frequency resolution is Afj = 2~- ? f max = 1/(2 Atj). 
The coefficients in the i th layer (0 < i < 2- 7 ) refer to the frequency bins characterized by 

i2-> < f I fmax <{l + 1)2^'. 

The output of a wavelet transform can be arranged in a binary tree where each node is 
an orthogonal vector subspace at level j and at layer i. Together the layers of the same 
level contain exactly the same information of the original time series, so that complete 
reconstruction of the original signal is possible by collecting the coefficients of any level. 
Anyway this is not the only possible choice to reconstruct the original time-signal: it can 
be shown (see e.g. [7|) that other admissible trees T completely represents the original 
signal in the wavelets domain, where an admissible tree is a sub-tree of the original binary 
decomposition tree where every node has exactly or 2 children nodes, see e.g. fig. [T], where 
the nodes of an example of an admissible tree are marked by red circles. We denote an 
admissible tree by T{k,i}, as it implies a choice of a set of pairs of level/layers. 
Having such a redundancy, how to choose the set of subspaces to represent the signal? The 
standard choice is to take all the layers of a single decomposition level, but even in this case 
one has the freedom to pick among several levels. 

The way we propose to choose in the set of admissible trees T^,i} is based on an entropy 
function E T defined as follows: 

ije{fc,z} T T 
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where the x^ are the wavelet coefficients of a level j and layer i and 

11*11?.= £ 4- ( 2 ) 

i,je{k,l} 

From the complete decomposition tree, we select among all the possible admissible trees 
the one which have the minimum cost, i.e. the T for which Ef is minimum, following the 
entropy cntenon (Q where a different notation is used). FVom among all the admissible 
binary trees, T is the one that represents the signal most efficiently, as it uses the least 
possible number of wavelet coefficients. 

For instance in the case in which only one of the x^s is non- vanishing (maximal concentra- 
tion) the entropy function is vanishing. On the other hand, if for some tree the decomposition 
coefficients are all equal, say Xij — 1/N, the entropy in this case is maximum, logiV. 




Figure 1: Schematic example of a binary tree for a wavelet decomposition up to level 3. The layers in 
each level are explicitly labeled for j < 3. The red circles show an example of a possible choice for a 
tree. In this examples it is made by choosing the subspaces with {j, i} = {(2, 0), (3, 2), (3, 3), (1, 1)}, 
from 1 121. 



After having performed the wavelet transform (a Symlet basis has been used) and the 
evaluation of the entropy function to determine the best tree, for each node in the resulting 
tree we set a threshold 0y equal to a fraction of the value of the maximum wavelet coefficient 
Wij in the node. Then a soft threshold function has been applied, i.e. each coefficient in the 
node has been changed according to the following function /: 

f(w itj (k)) =0 if Wij(k) < 6 itj 

f(w itj (k)) = w itj (k) - 9 id if Wi tj (k) > 0y 
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where k (0 < k < N2~i) runs over the coefficients of the layer. 

Then the time series has been recomposed from this mutilated wavelet coefficient set, 
where spurious disturbances (non concentrated in time and frequency) have been epurated. 
This will allow, as it is shown in the next section, to single out more clearly the excess in 
correlation due to actual signals. The wavelet decomposition has then been used to apply a 
non-linear filter to the r-series. 

III. RESULTS 

The cross correlation r of two time series {xt}, {yi}, each of length N, is given by 

r = E?=i(xi -x){yi -y) ^ 

where an overbar stands for average value. Such quantity measures the correlation between 
two data streams, as it would be produced by a common GW signal embedded in uncorre- 
cted detector noise |4| and it compares waveforms without being sensitive to their relative 
amplitude. In real cases, data are usually filtered and/or whitened before applying the cross- 
correlation analysis. We have simulated two series of uncorrelated, white noise data. To 
test if our method is able to enhance the cross correlation output we have injected common 
signals of different amplitude and shape into the two data streams. For simplicity we have 
concentrated our attention on one kind of signal, an exponential-sine, whose time profile is 
given by: 

h{t) = h sin(2vr/ t)e^ /r t > , (5) 

(and h(t) = for t < 0) for two different amplitudes, central frequency fo = 920Hz and r = 
10,100msec. The amplitudes have been chosen so to reproduce an SNR = 0.47,0.71,2.24 
and 1.48, where the SNR is defined as 

SNR = a s /a n , (6) 

where a n is the noise standard deviation and a s the signal standard deviations computed 
over a 5r period from the injection on. The correlation time-windows (determined by iV 
in eq. (j4j) and by the sampling time) are taken to be respectively 100, 500 msec for the 
t = 10, 100 msec injections. 



6 



T=100ms, SNR=0.71, Xcorr at injection points 
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Figure 2: Example of the cross-correlation of data streams for identical injections of the type © 
with /o = 920Hz,r = 100 msec, correlation time-window of 500 msec, SNR = 0.71, as defined in 
eq. ([6]). The figure displays the maximum value of the r-coefficient for each injection (separated 
by 4 sees), before (red) and after (black) the wavelet entropy filter has been applied. 



As an example, we report in fig. [2] the values of the maximum of the r-coefficients in 
correspondence of a series of injections (jSJ). 

In order to quantify the enhancement due to the entropy filter we define a gain factor 



141 ] of the r-coefficient for the data 



by considering, for each injection, the maximum value 
before (f) and after (rV) the treatment with the wavelet entropy filter and then define the 
quantity 

f w — f 



gam 



(7) 



Figs. [M6] show that for different choice of the injection parameters the wavelet entropy filter 
thus indeed lead to a statistical enhancement of the cross-correlation, the enhancement being 
stronger for stronger signals. 

In order to show that this method let excesses of correlation step out of the noise floor 
more clearly, while not introducing any significant increase of false alarm rate, we have eval- 
uated the average and standard deviation of the maximum of the r-coefficients obtained by 
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T=100ms, SNR=0.47 




Xcorr 



Figure 3: Histogram of gain factor, as defined by eq. ([7]) and correspondent average fi and standard 
deviation a. Parameters SNR and r of the injections are displayed, /o = 920Hz. 
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Figure 4: Histogram of gain factor. Parameters SNR and r are displayed, fo = 920Hz. 
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Figure 5: Histogram of gain factor. Parameters SNR and r are displayed, fo = 920Hz. 
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Figure 6: Histogram of gain factor. Parameters SNR and r are displayed, /o = 920Hz. 
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Correlation time-window 
(msec) 


r 


r w 


A* 


a 




a 


100 


0.21 


0.14 


0.21 


0.15 


500 


0.041 


0.081 


0.040 


0.075 



Table I: Background mean value /x and standard deviation a of the maximum cross-correlation 
coefficients obtained from white data (no injections) before (f) and after (fw) the application of 
the entropy filter. The maximum is computed over a time-interval of 200 msec. 







r = 10 msec 


t = 100 msec 






SNR=2.24 


SNR=1.48 


SNR=0.71 


SNR =0.48 


4a 


#r W 


127 


14 


153 


38 




#f 


84 


5 


84 


2 


3a 


#r w 


172 


91 


172 


73 




#f 


169 


70 


151 


28 



Table II: Number (out of 175 injections) of r and f^y-coefficients at injection time exceeding by 
3a and 4a the mean value of the r-coefficients of data without injections. The time-window for 
correlation are taken 100, 500 ms for r = 10, 100 ms respectively. 

cross-correlating white data without any injection before and after the application of the 
entropy filter. The result summarized in tab. d shows that in absence of signal the applica- 
tion of the wavelet entropy filter is irrelevant. 

On the other hand, analyzing the cross-correlation time series obtained by data with injec- 
tions, and counting how many f, f ^-coefficients have exceeded the average by more than 3a 
and 4a (/i and a reported in tab.flj, we have obtained the results reported in tab.flll showing 
that the wavelet entropy filter is powerful in enhancing the detection of signals embedded 
into noise. 

IV. CONCLUSIONS 

We have presented a method that makes use of a wavelet decomposition to determine 
which part of the cross-correlation time series can be zeroed without loosing interesting 
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feature of the signals, thus reducing the impact of noise and we showed that this method 
can enhance the detection effciciency of signal embedded into noise while not increasing the 
false alarm rate. We plan to release the code to the gravitational wave community in a near 
future. 
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